Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 922 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 79.4 KiB |
| Average record size in memory | 88.1 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 4 |
df_index is highly correlated with Payment Method Granted and 2 other fields | High correlation |
Credit Limit Granted is highly correlated with Commercial Risk Cover Protected and 4 other fields | High correlation |
Commercial Risk Group Code is highly correlated with Credit Limit Granted and 4 other fields | High correlation |
IDCliente is highly correlated with df_index and 2 other fields | High correlation |
VENTASCLIENTE is highly correlated with NArticulos and 1 other fields | High correlation |
NArticulos is highly correlated with VENTASCLIENTE and 1 other fields | High correlation |
NContratos is highly correlated with VENTASCLIENTE and 1 other fields | High correlation |
Commercial Risk Cover Protected is highly correlated with Credit Limit Granted and 4 other fields | High correlation |
Payment Method Granted is highly correlated with df_index and 6 other fields | High correlation |
Payment Terms Granted is highly correlated with df_index and 6 other fields | High correlation |
Status Code is highly correlated with Credit Limit Granted and 4 other fields | High correlation |
df_index has unique values | Unique |
IDCliente has unique values | Unique |
Credit Limit Granted has 492 (53.4%) zeros | Zeros |
Commercial Risk Group Code has 533 (57.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-01 18:03:45.133829 |
|---|---|
| Analysis finished | 2022-11-01 18:03:55.189431 |
| Duration | 10.06 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 922 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1514.378525 |
| Minimum | 1 |
|---|---|
| Maximum | 3891 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 92.05 |
| Q1 | 439.25 |
| median | 1204.5 |
| Q3 | 2458 |
| 95-th percentile | 3606.65 |
| Maximum | 3891 |
| Range | 3890 |
| Interquartile range (IQR) | 2018.75 |
Descriptive statistics
| Standard deviation | 1178.149944 |
|---|---|
| Coefficient of variation (CV) | 0.7779758658 |
| Kurtosis | -1.122984452 |
| Mean | 1514.378525 |
| Median Absolute Deviation (MAD) | 911 |
| Skewness | 0.4755752134 |
| Sum | 1396257 |
| Variance | 1388037.291 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 2066 | 1 | 0.1% |
| 2080 | 1 | 0.1% |
| 2081 | 1 | 0.1% |
| 2088 | 1 | 0.1% |
| 2090 | 1 | 0.1% |
| 2096 | 1 | 0.1% |
| 2101 | 1 | 0.1% |
| 2103 | 1 | 0.1% |
| 2106 | 1 | 0.1% |
| Other values (912) | 912 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 15 | 1 |
| Value | Count | Frequency (%) |
| 3891 | 1 | |
| 3888 | 1 | |
| 3886 | 1 | |
| 3875 | 1 | |
| 3874 | 1 | |
| 3859 | 1 | |
| 3855 | 1 | |
| 3845 | 1 | |
| 3833 | 1 | |
| 3832 | 1 |
| Distinct | 16 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16419.7397 |
| Minimum | 0 |
|---|---|
| Maximum | 45000 |
| Zeros | 492 |
| Zeros (%) | 53.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 45000 |
| 95-th percentile | 45000 |
| Maximum | 45000 |
| Range | 45000 |
| Interquartile range (IQR) | 45000 |
Descriptive statistics
| Standard deviation | 20070.84523 |
|---|---|
| Coefficient of variation (CV) | 1.222360744 |
| Kurtosis | -1.505293659 |
| Mean | 16419.7397 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.5909489081 |
| Sum | 15139000 |
| Variance | 402838828.2 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=16)
| Value | Count | Frequency (%) |
| 0 | 492 | |
| 45000 | 278 | |
| 10000 | 43 | 4.7% |
| 20000 | 27 | 2.9% |
| 25000 | 18 | 2.0% |
| 15000 | 13 | 1.4% |
| 5000 | 11 | 1.2% |
| 8000 | 7 | 0.8% |
| 28000 | 6 | 0.7% |
| 32000 | 5 | 0.5% |
| Other values (6) | 22 | 2.4% |
| Value | Count | Frequency (%) |
| 0 | 492 | |
| 5000 | 11 | 1.2% |
| 8000 | 7 | 0.8% |
| 10000 | 43 | 4.7% |
| 12000 | 3 | 0.3% |
| 15000 | 13 | 1.4% |
| 18000 | 4 | 0.4% |
| 20000 | 27 | 2.9% |
| 22000 | 4 | 0.4% |
| 25000 | 18 | 2.0% |
| Value | Count | Frequency (%) |
| 45000 | 278 | |
| 38000 | 3 | 0.3% |
| 35000 | 5 | 0.5% |
| 32000 | 5 | 0.5% |
| 30000 | 3 | 0.3% |
| 28000 | 6 | 0.7% |
| 25000 | 18 | 2.0% |
| 22000 | 4 | 0.4% |
| 20000 | 27 | 2.9% |
| 18000 | 4 | 0.4% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.3 KiB |
| 0.0 | |
|---|---|
| 95.0 | |
| 50.0 | 6 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.46637744 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3196 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 95.0 |
| 4th row | 0.0 |
| 5th row | 95.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 492 | |
| 95.0 | 424 | |
| 50.0 | 6 | 0.7% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 492 | |
| 95.0 | 424 | |
| 50.0 | 6 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1420 | |
| . | 922 | |
| 5 | 430 | 13.5% |
| 9 | 424 | 13.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2274 | |
| Other Punctuation | 922 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1420 | |
| 5 | 430 | 18.9% |
| 9 | 424 | 18.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 922 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3196 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1420 | |
| . | 922 | |
| 5 | 430 | 13.5% |
| 9 | 424 | 13.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3196 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1420 | |
| . | 922 | |
| 5 | 430 | 13.5% |
| 9 | 424 | 13.3% |
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.622559653 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 533 |
| Zeros (%) | 57.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.167435116 |
|---|---|
| Coefficient of variation (CV) | 1.335812284 |
| Kurtosis | -0.8030320798 |
| Mean | 1.622559653 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.8815205815 |
| Sum | 1496 |
| Variance | 4.697774983 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=8)
| Value | Count | Frequency (%) |
| 0 | 533 | |
| 5 | 120 | 13.0% |
| 4 | 71 | 7.7% |
| 2 | 55 | 6.0% |
| 3 | 54 | 5.9% |
| 1 | 41 | 4.4% |
| 6 | 37 | 4.0% |
| 7 | 11 | 1.2% |
| Value | Count | Frequency (%) |
| 0 | 533 | |
| 1 | 41 | 4.4% |
| 2 | 55 | 6.0% |
| 3 | 54 | 5.9% |
| 4 | 71 | 7.7% |
| 5 | 120 | 13.0% |
| 6 | 37 | 4.0% |
| 7 | 11 | 1.2% |
| Value | Count | Frequency (%) |
| 7 | 11 | 1.2% |
| 6 | 37 | 4.0% |
| 5 | 120 | 13.0% |
| 4 | 71 | 7.7% |
| 3 | 54 | 5.9% |
| 2 | 55 | 6.0% |
| 1 | 41 | 4.4% |
| 0 | 533 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.3 KiB |
| 0.0 | |
|---|---|
| 99.0 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.46637744 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3196 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 99.0 |
| 4th row | 0.0 |
| 5th row | 99.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 492 | |
| 99.0 | 430 |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 492 | |
| 99.0 | 430 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1414 | |
| . | 922 | |
| 9 | 860 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2274 | |
| Other Punctuation | 922 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1414 | |
| 9 | 860 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 922 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3196 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1414 | |
| . | 922 | |
| 9 | 860 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3196 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1414 | |
| . | 922 | |
| 9 | 860 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.3 KiB |
| 0.0 | |
|---|---|
| 180.0 |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.932754881 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3626 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 180.0 |
| 4th row | 0.0 |
| 5th row | 180.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 492 | |
| 180.0 | 430 |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 492 | |
| 180.0 | 430 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1844 | |
| . | 922 | |
| 1 | 430 | 11.9% |
| 8 | 430 | 11.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2704 | |
| Other Punctuation | 922 | 25.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1844 | |
| 1 | 430 | 15.9% |
| 8 | 430 | 15.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 922 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3626 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1844 | |
| . | 922 | |
| 1 | 430 | 11.9% |
| 8 | 430 | 11.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3626 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1844 | |
| . | 922 | |
| 1 | 430 | 11.9% |
| 8 | 430 | 11.9% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.3 KiB |
| 2.0 | |
|---|---|
| 66.0 | |
| 8.0 | 5 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.460954447 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3191 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 66.0 |
| 4th row | 2.0 |
| 5th row | 66.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 492 | |
| 66.0 | 425 | |
| 8.0 | 5 | 0.5% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2.0 | 492 | |
| 66.0 | 425 | |
| 8.0 | 5 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 922 | |
| 0 | 922 | |
| 6 | 850 | |
| 2 | 492 | |
| 8 | 5 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2269 | |
| Other Punctuation | 922 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 922 | |
| 6 | 850 | |
| 2 | 492 | |
| 8 | 5 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 922 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3191 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 922 | |
| 0 | 922 | |
| 6 | 850 | |
| 2 | 492 | |
| 8 | 5 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3191 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 922 | |
| 0 | 922 | |
| 6 | 850 | |
| 2 | 492 | |
| 8 | 5 | 0.2% |
| Distinct | 922 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26380795.55 |
| Minimum | 110007 |
|---|---|
| Maximum | 63500002 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.3 KiB |
Quantile statistics
| Minimum | 110007 |
|---|---|
| 5-th percentile | 1340502 |
| Q1 | 7087505 |
| median | 24745004 |
| Q3 | 41877503.25 |
| 95-th percentile | 59105502.1 |
| Maximum | 63500002 |
| Range | 63389995 |
| Interquartile range (IQR) | 34789998.25 |
Descriptive statistics
| Standard deviation | 19359415.68 |
|---|---|
| Coefficient of variation (CV) | 0.7338450291 |
| Kurtosis | -1.225196293 |
| Mean | 26380795.55 |
| Median Absolute Deviation (MAD) | 17580000 |
| Skewness | 0.2746874114 |
| Sum | 2.432309349 × 1010 |
| Variance | 3.747869753 × 1014 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 110007 | 1 | 0.1% |
| 36230002 | 1 | 0.1% |
| 36420003 | 1 | 0.1% |
| 36430003 | 1 | 0.1% |
| 36510002 | 1 | 0.1% |
| 36540002 | 1 | 0.1% |
| 36660006 | 1 | 0.1% |
| 36710002 | 1 | 0.1% |
| 36730004 | 1 | 0.1% |
| 36770003 | 1 | 0.1% |
| Other values (912) | 912 |
| Value | Count | Frequency (%) |
| 110007 | 1 | |
| 130030 | 1 | |
| 140010 | 1 | |
| 150003 | 1 | |
| 180003 | 1 | |
| 190002 | 1 | |
| 200004 | 1 | |
| 220005 | 1 | |
| 230004 | 1 | |
| 250008 | 1 |
| Value | Count | Frequency (%) |
| 63500002 | 1 | |
| 63450002 | 1 | |
| 63420002 | 1 | |
| 63240002 | 1 | |
| 63220002 | 1 | |
| 62960002 | 1 | |
| 62860002 | 1 | |
| 62700002 | 1 | |
| 62480002 | 1 | |
| 62470002 | 1 |
| Distinct | 891 |
|---|---|
| Distinct (%) | 96.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 729.5120912 |
| Minimum | 50.25 |
|---|---|
| Maximum | 4743.43 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.3 KiB |
Quantile statistics
| Minimum | 50.25 |
|---|---|
| 5-th percentile | 71.71 |
| Q1 | 205.45 |
| median | 394.79 |
| Q3 | 927.075625 |
| 95-th percentile | 2489.53485 |
| Maximum | 4743.43 |
| Range | 4693.18 |
| Interquartile range (IQR) | 721.625625 |
Descriptive statistics
| Standard deviation | 822.9740137 |
|---|---|
| Coefficient of variation (CV) | 1.128115659 |
| Kurtosis | 4.494816529 |
| Mean | 729.5120912 |
| Median Absolute Deviation (MAD) | 251.6275 |
| Skewness | 2.083626812 |
| Sum | 672610.1481 |
| Variance | 677286.2272 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 225.45 | 3 | 0.3% |
| 205.45 | 3 | 0.3% |
| 71.71 | 3 | 0.3% |
| 90.9 | 3 | 0.3% |
| 178.885 | 2 | 0.2% |
| 173.72 | 2 | 0.2% |
| 52.12 | 2 | 0.2% |
| 59.59 | 2 | 0.2% |
| 121.2 | 2 | 0.2% |
| 110.09 | 2 | 0.2% |
| Other values (881) | 898 |
| Value | Count | Frequency (%) |
| 50.25 | 1 | |
| 50.5 | 2 | |
| 50.88 | 2 | |
| 51.51 | 1 | |
| 51.616 | 1 | |
| 52.12 | 2 | |
| 52.52 | 1 | |
| 53.28 | 1 | |
| 53.6512 | 1 | |
| 54.237 | 1 |
| Value | Count | Frequency (%) |
| 4743.43 | 1 | |
| 4493.27 | 1 | |
| 4363.853 | 1 | |
| 4265.42 | 1 | |
| 4242.75 | 1 | |
| 4011.915 | 1 | |
| 4008.7124 | 1 | |
| 3989.625 | 1 | |
| 3958.899 | 1 | |
| 3900.3345 | 1 |
| Distinct | 346 |
|---|---|
| Distinct (%) | 37.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.38088937 |
| Minimum | 1 |
|---|---|
| Maximum | 471.75 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4.05 |
| Q1 | 11 |
| median | 29 |
| Q3 | 78.78125 |
| 95-th percentile | 254.9375 |
| Maximum | 471.75 |
| Range | 470.75 |
| Interquartile range (IQR) | 67.78125 |
Descriptive statistics
| Standard deviation | 85.00419076 |
|---|---|
| Coefficient of variation (CV) | 1.320332658 |
| Kurtosis | 5.391693684 |
| Mean | 64.38088937 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 2.276235304 |
| Sum | 59359.18 |
| Variance | 7225.712446 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 5 | 76 | 8.2% |
| 6 | 25 | 2.7% |
| 4 | 23 | 2.5% |
| 3 | 22 | 2.4% |
| 13 | 21 | 2.3% |
| 12 | 20 | 2.2% |
| 7 | 20 | 2.2% |
| 9 | 20 | 2.2% |
| 10 | 17 | 1.8% |
| 8 | 16 | 1.7% |
| Other values (336) | 662 |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 3 | 22 | 2.4% |
| 4 | 23 | 2.5% |
| 5 | 76 | |
| 6 | 25 | 2.7% |
| 7 | 20 | 2.2% |
| 8 | 16 | 1.7% |
| 9 | 20 | 2.2% |
| 10 | 17 | 1.8% |
| Value | Count | Frequency (%) |
| 471.75 | 1 | |
| 459.75 | 1 | |
| 442.25 | 1 | |
| 439 | 1 | |
| 438 | 1 | |
| 436 | 1 | |
| 425.5 | 1 | |
| 417.5 | 1 | |
| 415.75 | 1 | |
| 405 | 1 |
| Distinct | 127 |
|---|---|
| Distinct (%) | 13.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.52711497 |
| Minimum | 1 |
|---|---|
| Maximum | 179 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 7 |
| median | 15 |
| Q3 | 33 |
| 95-th percentile | 98 |
| Maximum | 179 |
| Range | 178 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 32.30449964 |
|---|---|
| Coefficient of variation (CV) | 1.173551957 |
| Kurtosis | 5.234898291 |
| Mean | 27.52711497 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 2.256570701 |
| Sum | 25380 |
| Variance | 1043.580697 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 5 | 87 | 9.4% |
| 6 | 76 | 8.2% |
| 10 | 40 | 4.3% |
| 7 | 38 | 4.1% |
| 4 | 36 | 3.9% |
| 12 | 31 | 3.4% |
| 9 | 30 | 3.3% |
| 16 | 28 | 3.0% |
| 13 | 25 | 2.7% |
| 11 | 24 | 2.6% |
| Other values (117) | 507 |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 2 | 2 | 0.2% |
| 3 | 23 | 2.5% |
| 4 | 36 | |
| 5 | 87 | |
| 6 | 76 | |
| 7 | 38 | |
| 8 | 21 | 2.3% |
| 9 | 30 | 3.3% |
| 10 | 40 |
| Value | Count | Frequency (%) |
| 179 | 1 | |
| 174 | 1 | |
| 166 | 1 | |
| 165 | 1 | |
| 164 | 1 | |
| 161 | 1 | |
| 160 | 2 | |
| 159 | 1 | |
| 158 | 1 | |
| 153 | 1 |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| df_index | Credit Limit Granted | Commercial Risk Cover Protected | Commercial Risk Group Code | Payment Method Granted | Payment Terms Granted | Status Code | IDCliente | VENTASCLIENTE | NArticulos | NContratos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 110007 | 830.1700 | 121.00 | 121 |
| 1 | 3 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 130030 | 2402.7208 | 373.00 | 165 |
| 2 | 4 | 45000.0 | 95.0 | 1.0 | 99.0 | 180.0 | 66.0 | 140010 | 2044.0675 | 202.50 | 94 |
| 3 | 5 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 150003 | 295.3400 | 13.00 | 13 |
| 4 | 8 | 20000.0 | 95.0 | 3.0 | 99.0 | 180.0 | 66.0 | 180003 | 239.8400 | 12.00 | 12 |
| 5 | 9 | 45000.0 | 95.0 | 3.0 | 99.0 | 180.0 | 66.0 | 190002 | 388.8300 | 24.25 | 9 |
| 6 | 10 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 200004 | 154.5300 | 6.00 | 6 |
| 7 | 12 | 45000.0 | 95.0 | 2.0 | 99.0 | 180.0 | 66.0 | 220005 | 339.2500 | 39.00 | 24 |
| 8 | 13 | 45000.0 | 95.0 | 3.0 | 99.0 | 180.0 | 66.0 | 230004 | 591.3400 | 108.50 | 41 |
| 9 | 15 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 2.0 | 250008 | 983.5400 | 115.00 | 44 |
Last rows
| df_index | Credit Limit Granted | Commercial Risk Cover Protected | Commercial Risk Group Code | Payment Method Granted | Payment Terms Granted | Status Code | IDCliente | VENTASCLIENTE | NArticulos | NContratos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 912 | 3832 | 45000.0 | 95.0 | 2.0 | 99.0 | 180.0 | 66.0 | 62470002 | 418.50 | 5.0 | 5 |
| 913 | 3833 | 45000.0 | 95.0 | 6.0 | 99.0 | 180.0 | 66.0 | 62480002 | 151.50 | 5.0 | 5 |
| 914 | 3845 | 10000.0 | 95.0 | 6.0 | 99.0 | 180.0 | 66.0 | 62700002 | 133.23 | 14.0 | 12 |
| 915 | 3855 | 45000.0 | 95.0 | 5.0 | 99.0 | 180.0 | 66.0 | 62860002 | 411.88 | 61.0 | 7 |
| 916 | 3859 | 45000.0 | 95.0 | 3.0 | 99.0 | 180.0 | 66.0 | 62960002 | 177.72 | 5.0 | 5 |
| 917 | 3874 | 20000.0 | 95.0 | 5.0 | 99.0 | 180.0 | 66.0 | 63220002 | 225.45 | 5.0 | 5 |
| 918 | 3875 | 38000.0 | 95.0 | 4.0 | 99.0 | 180.0 | 66.0 | 63240002 | 253.74 | 9.0 | 9 |
| 919 | 3886 | 45000.0 | 95.0 | 5.0 | 99.0 | 180.0 | 66.0 | 63420002 | 191.85 | 17.0 | 6 |
| 920 | 3888 | 45000.0 | 95.0 | 3.0 | 99.0 | 180.0 | 66.0 | 63450002 | 426.34 | 62.0 | 10 |
| 921 | 3891 | 45000.0 | 95.0 | 5.0 | 99.0 | 180.0 | 66.0 | 63500002 | 246.54 | 5.0 | 5 |